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Figure A 

cDNA sequence of SEQ ID NO: 2076 with the start and stop codons in bold and boxed, and the intron 
splice sites indicated with IL The three exons are underlined. The 5' and 3' untranslated regions are in 
italics. 

GAAAAGGGCCGTGGACGATGGAAGAGCATCTCATCCTCAT^^ 
CCCAAACCTG<rreG//TCTAAAACGTACC 
GGCAACATCACTACTGAGGAGCAGCTCCTC 
^TgTTCCCGGAAGGACTGAC&ATGAGATAAAOAACITC^ 

C T CTGQT CAQAGCTCCGAGATGAG T GAT CAAGCAAG CAC^^GC CACATOTCCAGCATG CC AGAGCCGATGGAGACXTTACGACT 

CACCGCCGTCaTTCCAAGGCAACAACAACATGGAGCCTTTGCCGGTGAATTTO 

ATGGACX2ATCTTTGGTGTATGCAGTTACTCAA 

TAC^TAATAGCftACl GriTCfriTA^ 

QAAG TTA TCTACAAA TATGTGCATGAG TTG TAAA CGAAACTA CCA TCTGCAGTTTGCATCCCCGCTA TQTAATGACTGAAA TA 
ATGAAGCGAGATTATTTGGCTTAAAAAAAAAAAA 
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Figure B 

CLUSTAL X (l.a) multiple sequence alignment of the amino acid translation of SEQ id 
NO 2076 with the poplar gene prediction using Twins can (Washington University) with 
a locally assembled Poplar contig from Phrap (University of Washington) alignment 
of genomic Poplar sequences {Populus balsamlfera) made available by the Department 
of Energy Joint Genome Institute (University of California) and A. thaliana 
homologues (The Arabidopsis information Resource (tair) Accession Nob At5g4035Q,l 
and At3g01530.1) . 

Poplar MDKS P cnsqdvbvrkgpwtleedliltnyianhgegvwnslakaaglk 

SEQ NO 2076 MDKK PDDDSGKSQtJVEVRKGPWTMEEDLI L INY I ANHG&GSWNSIxAKAAGLK 

At5g40350 . 1 MEKR ESSGGSGSGDAEVRKGPWTMEEDLILINYIANHGEGVWNSLAKSAGIjK 

At3g0153 0 . 1 METTMKKKGRVKATITSQKEEEGTVRKaPWTMEKDFILFNYILNHGECfL 

*_* . • *#**♦**. *** l ** *** ***** #+*.*+;.++* 

Popl ar RTGKSCRLRWLNYLRPDLRRQliITTPEEQLLIMEIiHAKLaHRWSKI AKHLPGRTDNEIiCNY 

SEQ NO 2076 RTGKS CRLRWLNYLR PD VRRGNI TTEEQIiL I MELHAKWGNRWS KIAKHL PGRTDfl E I KNF 

At5g4 0350.1 RTG^CRLRWLNYLRPDVHJlGOfcTITPEEQLTIM^ 
At3g01530 . 1 RTGKSCRI^SfliNYLRPDVRRGNITEEEQLLIIQIJlAKl/SN^ 

a**********-****** . ****** **** *..**«* ********************* : 

Poplar WRTRIKKHTK OTEPPAAGSSETNEHGSSTCQVS- SATDQMETY-CPPFYOjG- -0V-G 

SEQ NO 2076 WRTRIQKHIK- - -QAEAFSGQSSEMSDQ-ASTSEMS-SMPBPMETYDSPPSPQGNNNM-E 

AtSg4 0350 . 1 WRTKI QKYI IKSGETTTVGS QSSEFINHHATTSHVMNDTQBTMDMYS PTTS YQHASN INQ. 

At3g0153 0.1 WRTKI QRHMK - - VS SENMMNHQHHCSGN - SQS SGMT- TQGSSGKAI DTAES FSQAKTT - - 



Poplar AFSGGN - -IPQBLNE-NYWSMEDLWSMQLLNGD 

SEQ NO 2076 PLP-VN LSVESNE-AYWSMDDLWSMQLLNGD 

At5g403S0 . X QLNYGNYVPESG3 IMMPLSVDQSEQNTWSVDDLWPMNI YNGN 

At3g01530. 1 TFN vVEQQSNE-NYWNVEDLWPVHLLNGDHEVI 

::.***.:: *** . : : i ** r 
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Figure C 



Alignment of cDNAs of SEQ ID NO: 2076, A. thaliana At5g40350, Poplar spp. 

Predicted intron/exon boundaries (underlined and bold) , based on the AraMdopsis 
rhaliana At5g40350 splicing sites and confirmed from a gene prediction using 
Twinscan with a locally assembled Poplar contig from Phrap alignment of genomic 
Poplar sequences (Populua balsamifera) made available by the Department of Energy 
Joint Genome Institute, are identified for poplar and SEQ ID NO: 2076. The splice 
sites are indicated by // . 

CLUSTAL X (1.8) multiple sequence alignment 



SEQ ID NO 2076 ATGGACAAGAAGCCMACGACGACAGT 

Poplar ATGGATAAAAGTCCATGCA- - -AC- - - 

At3g0 153 0 . 1 ATGGAGACGACGATGAAC3AAGAAAGCK3AGAGTGAAAGCGACAATAACGTC 

At5g4 03 50 .1 ATGGAGAAAAGAGAAAGTAGTGGTGG- 

* *** * 

SEQ ID NO 2076 G - GTAAGTCCCAAOATGTCGAGGTGAGAAAAGGGCCGTGGACGATCGAAG 

Poplar TCTCAGGATGTTGAAGTGAGAAAAGGGCCATGGACCTTGGAAG 

At 3g 0 1 5 3 0 . 1 ACAGAAAGAAGAAGAAGGi^CAGTGAGAAAAGGACCTTGGACTATGGAAG 
At 5g4 0 3 5 0 . 1 GTCTGGATCAGGAGATGCAGAGGTGAGAAAAGGGCCATGGACGATGGAAG 

** * *********** ** ***** ****** 

SEQ ID WO 2076 AGXSATCTCATCCTC^TCAACTACATAGC^^ 

Poplar AAGACTTGATC TTAAC CAACTACATCGCGAAC CATGGTGAAGGTGTATGG 

At 3 go 153 0 . 1 AAGATTTCATCCTCTTTAATTACATCCrTAATCATGGTGAAGGTC'i'lTGG 

At5g4 03 5 0 - 1 AAGAl^TGATTCTCATCAATTATATCKCCAATCATGGTGAAGGTGTTTGG 

******* ** ** ** »* ** ** ***** *** 

SEQ ID NO 2076 AACTCCCTAGCC AAAG CTGCTG/ /QTCT AAAA CGTACCGGGAAGAGTTGTCG 

Poplar AACTCGCTTGCAAAAG CTGCAG/ /QTCTGAAAC GTACCGGAAAOAGTT QTAQ 

At3g0 153 0 . 1 AACTCTGTCGCCAAAGCCTCTG/ / GTCTAAAACGTACTGGAAAAAGTTGTCG 

At 5g4 0 3 5 0 . X AACTCTCTCGCCAAAT CTGCAG/ /gactaaaac gcaccgggaaaaqttgccg 

***** * ** *** * * * * ** ***** ** ** ** ***** * 

SEQ ID NO 2076 GCTCCGGTGGCTGAACTATCTGCGACCCGACGTCCGGAGAGGCAACATCA 
Poplar GCTCQ3TTGG<^CAACTACrTGC3GGCCnX^CCTTCGAAGAGGGAATATTA 
At 3g0 15 2 0 . 1 GCTCCGGTGGCTGAACTATCTCCGACCAGATGTGCGGCGAGGGAACATAA 
At 5g4 0350.1 GCTCCGGTGGCTGAACTACCTCCGACCTGATGTGCGACGGGGAAATATCA 

****** ***** ***** * *-* ** ** * ** * ** ** ** * 

SEQ ID NO 2076 CrTACTGAGGAGCAGCTCCK5ATCATGGAACTG<!ATGCCAAGTCGGGAAAC 
Poplar GTCCTGAAGAACAQCTCTTQATCATGGAACTG CATGCTAAG TTGGGAAftC 

At3goi53o.i cx:gaagaagaac^gcttttgatcattcagcttcatgctaagcttggaaac 
At5g40350 . 1 caccagaagaacagctcaccatcatggaacttcatgcaaaatggggaaat 

* *• ** ***** ***** * ** ***** ** ***** 

SEQ ID NO 2076 AG/^ajM^TCTAAAATTGCAAAGGAO'CTTCCCGGAAGGACTGACAATGAGAT 
Poplar AG/ / gTGGTC GAAAATTGGAAAGCATCTTCCAGGGAGGACCnArAArn&rt^T 

At3g0l53 0 . 1 AG//GTGGTCmAGATTGCGAAGCATCTTCCGGGAAGAACGGACAACGAGAT 

At5g4 0350.1 ag//gtg^tcaaaaattocaaagcatttaccaggaagaaccgacaatcagat 
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** ****** ** ***** ****** * ** ** ** ** ***** ***** 



SEQ ID NO 2076 
Poplar 
At3g01530.1 
At5g40350.1 



aaagaacttctggaggactagaatccaaaagcacatcaagcaagcacagg 
aaagaattactgqaqaactagaat caag aag c atact aagcaaac tgaac 
aaagaacttctooa^acaaaoattcajgagacacatgaaagtgtcatcgg 

AAAOAATTTTTGGAGaACTAAGATCCAGAAATACATC^TCAAGAGCGGAG 
****** * ***** ******* * * * 



SEQ ID NO 2076 
Poplar 
At3g0l530.1 
At5g40350.1 



CTTTCTCTGGTC AGAOCTCCGAGATGAGTGATCAAGCAAO - 

CATTTGCGGCGG GGAGTTCTGAGACTAATGAACATQGGAG - 

AAAATATGAT GAATCATCAACATCATTGTTCGGGAAACT 

AAACGACGACCGTTGGATCAO\AAGCTCCGAGTTTATAAACCaTCATGC - 

* * * • 



SEQ ID NO 2076 
Poplar 
At3g01530.1 
At5g40350 .1 



SEQ ID NO 2076 
Poplar 
At3g01530.1 
At5g40350.1 



SEQ ID NO 2076 
Poplar 
At3g01530.1 
At5g40350.1 



CACAA- - -GCCACATCTCGAGCATGC- - CA- GAGCCGATGGAG ACCTACG 
CAGTACTTGCCAAGTTrCCAGCXSCAA' - CC-GACCAAATGGAGACCTATT 

c^cagagctcggggatgacgaogcaaggca-gctccggcaaagccataga 
gacaaogagccatatc^tgaatgatactcaagaaaccatggatatgtatt 

* * * * * »* 

ACTCACCGCCG- -TCATTCCAA GGCAACAACAACATGGAGCCTT 

GTCCACCA TTCTATCAA GGAGACGTA- GGGGCTTT 

CAGGGCTGAGAGCTTCTCTCAG GCGAAGACGA- - -CGACGTTTA 

CTCCAACGACG- - TCGT ATCAACATG CCAGCAAT ATTAATCAGCAGC TTA 
* * *★ * * * * 

-TGCCGGTG- - AATTTGTCGOTCG 

-TTCTGGTGGGAATATACCTCAAG 

- ATGTGGTGG - AA - - CAAC 

ATTATGGTAATTATGTGCCTGAATCCGGTTCX1ATCATGATGCCATTATCT 



SEQ ID NO 2076 
Poplar 
At3g01530.1 
AtSg40350 .1 



SEQ ID NO 2076 
Poplar 
At3g01530.1 
At5g40350-1 



- AGTCAAATGAAGCCTACTGX3AGCATGGACGATCTTTGGTCTAT 

AACOtSAACGAAAACTATTGGAGC^TGGAGGATGTCTGGTCCAT 

AGTCaAACGAGAATTACTGGAACGTTGAAGATCTGTGGCCCGT 

GTTGATCAATCCGAACAAAACTATTGGAGCGTCGATGATCTTTGGCCCAT 
* * * •* **** * * ** ***** *** * * 

GCAGTTACTCAATGGGGATTGA 

GCAACTACTTAATGGCGAT . , - - ~ _ 

C CACTTGCTT AATGG TG AC CAC CATGTGATTT AA 

GAAT ATAT AT AATGG TAATTAA 

* * ***** * 
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Figure D 



Results from a BLASTX homology saarch using SEQ ID NO 2076: 

At5g403S0.1 myb family transcription factor / ainil&r to Myb26 
Ql:lfl4l475 from [Pisuin sativum]; supported by 
full-length cDHA: Ceres: 262460. 
Length - 214 

Score - 246 bits 1606), Expect - 6e-64 

Identities = 125/209 £59%), Eooitivea = 146/209 (69%) , Saps = 32/209 (15*) 

Query: 8 SQDVITVRFEPWTljEiaJIJLTNYIJUraGB 67 

S D EVRKGPWT+HEDLIL OTIANHCTOT1ffiSIJW4^ACXJCRTQK5CI^WUJyi£PD-t-K 
SbjCt - 12 SOTAEVRFEFWTMBBDLILINYIANHG^GVWNew^ 71 

Query? 68 rg^TPEBQLLIMEISTUCLG^WSKXAKHLPGRTIJNEIKNTHRTRXKKH TKQTBPFA 124 

RGSITPEEQL IMElJiAK SHRMSK1AXHLPGRTDMHIKM+MBT+1+K+ 4- +T 
SbjCt: 72 RfmTPEEQLTIMKIJIMaKSimWSKIAlO^^ 121 

Query s 125 AQSSBTWBHGSSTCCySSATDQ-KBTYCP-PPYQGDVGAPSQGNiegiSUI — - 172 

+ SSB H++TV+T+M+YP YQ HI Q+U9 

Sbjct - 132 SQSSEPIMHHATTSHVMHDTQBTJfflMYSPTTSYQ HASNINQQI2JYCNYVPESGS 185 

Query i 173 EHYWSMEDLWSMQLLNGD 190 

+HYWS++DDW » + NG-h 
Sbjcti IBS IHKPLSVDQSBQNyWSVDDLHPHNIYNGH 214 



AC3g01530.1 myb family transcription facror / contains PPAH 
profile: myto DMA binding domain PF00249 /supported by 
full-length. cDNAt CereSi94595. 

Length = 206 
Scare = 23Z bits t5se) , sxpect - ae-62 

Identities - 118/19? (60%) , Positives = 130/lP« (70%) , <3*pC - 13/196 (6%) 

Query; 3 KSPOJSQDVE VRKOPVTLEEDLIUn^ANHGBGVHH3lAKAAGUCRTC 59 

K+ SQ B VRKGPWT+EBD IL flYI NHGEG+MNS+AKA+5IiKRTGKSCRTiHWL 
Sbjct: 12 KATITSQKEEEGTVRKGPWTMEBDFILiPiraiLNra 71 

Query i 60 HYIJ^DIJUWOTTPEEQLLIMELHAKLt^^ 119 

mfliRPD+RRONIT BBQIiLI ++LHAKLGHRWSltlAJCroiPGKTDIJSI KN+WRT+I+ +H EC 
Sbjct : 72 imiRFDVRRGtrcTBEfiQXiLIIQLHAlCIjI^WSCT^ 13 1 

Quorys 120 TEPFAAOSBETOEHCSSTCQVBBATDQMBT YCPPFYQGnvSAPSOONIPQBLNBN 174 

+ + H 3 Q 3 T Q + FQ p + + NEW 

Sbjct: 132 6 SENMMNHQHhCSGHS QSSaMTTQQS SSKAIDTABSPSQAKTITKN — WEQQSNEN IBS 

Query: 17$ YVf&MEDLWSMQLLNGD 190 

YW++EDLW + LUh$D 
Sbjct t 197 YXHVEDLWPVHLLKGD 202 
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Figure E 



Phylogenetic comparison o£ SEQ ID NO: 2076 with A. thai i ana homologuea At3g01530.1 
and At:sg40350 .1, and Poplar tP. halsamifera, gene prediction using Twinscan 
(Washington University) with a locally assembled Poplar contig from Phrap 
(University of Washington) alignment of genomic Poplar sequences made available by 
the Department of Energy Joint Genome Institute (University of California) . This 
tree was generated using NJPlot, 
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